Noisy speech recognition by using output combination of discrete-mixture HMMs and continuous-mixture HMMs
نویسندگان
چکیده
This paper presents an output combination approach for noiserobust speech recognition. The aim of this work is to improve recognition performance for adverse conditions which contain both stationary and non-stationary noise. In the proposed method, both discrete-mixture HMMs (DMHMMs) and continuous-mixture HMMs (CMHMMs) are used as acoustic models. In the DMHMM, subvector quantization is used instead of vector quantization and each state has multiple mixture components. Our previous work showed that DMHMM system indicated better performance in low SNR and/or non-stationary noise conditions. In contrast, CMHMM system was better in the opposite conditions. Thus, we take a system combination approach of the two models to improve the performance in various kinds of noise conditions. The proposed method was evaluated on a LVCSR task with 5K word vocabulary. The results showed that the proposed method was effective in various kinds of noise conditions.
منابع مشابه
Discrete-Mixture HMMs-based Approach for Noisy Speech Recognition
It is well known that the application of hidden Markov models (HMMs) has led to a dramatic increase of the performance of automatic speech recognition in the 1980s and from that time onwards. In particular, large vocabulary continuous speech recognition (LVCSR) could be realized by using a recognition unit such as phones. A variety of speech characteristics can be modelled by using HMMs effecti...
متن کاملNoisy Speech Recognition with Discrete-Mixture HMMs Based on MAP Estimation
In this paper, we develop a novel modeling scheme for discrete-mixture HMMs (DMHMMs) by using maximum a posteriori (MAP) estimation. Also the MAP estimated DMHMMs are used for speech recognition to improve the accuracy under noisy conditions. The DMHMMs were originally proposed to reduce calculation costs in decoding process [1][2]. We propose a new method for MAP estimation of DMHMM parameters...
متن کاملEfficient speech recognition using subvector quantization and discrete-mixture HMMs
This paper introduces a new form of observation distributions for hidden Markov models (HMMs), combining subvector quantization and mixtures of discrete distributions. We present efficient training and decoding algorithms for the discretemixture HMMs (DMHMMs). Our experimental results in the airtravel information domain show that the high-level of recognition accuracy of continuous mixture-dens...
متن کاملReviving discrete HMMs: the myth about the superiority of continuous HMMs
Despite what is generally believed, we have recently shown that discrete-distribution HMMs can outperform continuousdensity HMMs at significantly faster decoding speeds. Recognition performance and decoding speed of the discrete HMMs are improved by using product-code Vector Quantization (VQ) and mixtures of discrete distributions. In this paper, we present efficient training and decoding algor...
متن کاملHybrid Training Method for Tied Mixture Density Hidden Markov Models Using Learning Vector Quantization and Viterbi Estimation
In this work the output density functions of hidden Markov models are phoneme-wise tied mixture Gaussians. For training these tied mixture density HMMs, modiied versions of the Viterbi training and LVQ based corrective tuning are described. The initialization of the mean vectors of the mixture Gaussians is performed by rst composing small Self-Organizing Maps representing each phoneme and then ...
متن کامل